A Search for Best Error Metrics to Predict Discrimination of Original and Spectrally Altered Musical Instrument Sounds
نویسندگان
چکیده
The correspondence of various spectral difference error metrics to human discrimination data was investigated. Time-varying harmonic amplitude data were obtained from the spectral analysis of eight musical instrument sounds (bassoon, clarinet, flute, horn, oboe, saxophone, trumpet, and violin). Sounds were resynthesized with various levels of random spectral alteration, ranging from 1 to 50%. Listeners were asked to discriminate the randomly altered sounds from reference sounds resynthesized from the original data. Then several formulas designed to predict discrimination performance were evaluated by calculating the correspondence between the discrimination data and the associated spectral difference measurements. Averaged over the eight instruments, the best correspondence was achieved using a spectral error metric based on linear harmonic amplitude differences normalized by rms amplitude and raised to a power a. While an optimum correspondence of 91% was achieved for a 0.64, good correspondence occurred over a wide range of a. For linear harmonic amplitudes without rms normalization, good correspondence occurred within a narrower range, with a maximum correspondence of 88%. Correspondence was approximately 80% for decibelamplitude differences over an even narrower range. Other error metrics such as those based on critical-band grouping of components worked well but did not give any improvement over the method based on harmonic amplitudes, and in some cases yielded worse results. Spectral differences using a small number of representative frames emphasizing attack and decay transients yielded results slightly better than using all frames.
منابع مشابه
Evaluation of Mel-Band and MFCC-Based Error Metrics for Correspondence to Discrimination of Spectrally Altered Musical Instrument Sounds*
Several mel-band-based metrics and a single MFCC-based error metric were evaluated for best correspondence with human discrimination of single tones resynthesized from similar musical instrument time-varying spectra. Results show high levels of correspondence that are very close and often nearly identical to those found previously for harmonic and critical-band error metrics. The number of spec...
متن کاملDetection of random alterations to time-varying musical instrument spectra.
The time-varying spectra of eight musical instrument sounds were randomly altered by a time-invariant process to determine how detection of spectral alteration varies with degree of alteration, instrument, musical experience, and spectral variation. Sounds were resynthesized with centroids equalized to the original sounds, with frequencies harmonically flattened, and with average spectral error...
متن کاملDetection of Random Spectral Alterations of Sustained Musical Instrument Tones in Repeated Note Contexts
Eight sustained musical instrument sounds were randomly altered by a time-invariant process to determine how well spectral alteration could be detected on repeated notes. Sounds were resynthesized in a series of eight 0.25-second repeated notes and spectrally altered with average spectral alterations of 8, 16, 24, 32, and 48%. Listeners were asked to discriminate each randomly altered repeated ...
متن کاملDiscrimination of musical instrument sounds resynthesized with simplified spectrotemporal parameters.
The perceptual salience of several outstanding features of quasiharmonic, time-variant spectra was investigated in musical instrument sounds. Spectral analyses of sounds from seven musical instruments (clarinet, flute, oboe, trumpet, violin, harpsichord, and marimba) produced time-varying harmonic amplitude and frequency data. Six basic data simplifications and five combinations of them were ap...
متن کاملEffect of critical band data reduction on musical instrument sounds
Time-varying spectra of three sets of musical instrument tones were data-reduced using two different methods based on critical bands. One method (used for Tests 1 and 2) preserved the average spectrum and rms amplitude-vs-time envelope for each band. The other method (used for Test 3) utilized critical-band smoothing of the spectral envelope. 20 subjects were tested for their ability to discrim...
متن کامل